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Abstract 

Let tt„ be the n n -element set consisting of functions that have [n] as 
both domain and codomain. Since Q n is finite, it is clear by the pigeonhole 
principle that, for any / £ Q n , the sequence of compositional iterates 

/,/ (2 \/ (3 \/ (4) ... 

must eventually repeat. Let T(/) be the period of this eventually periodic 
sequence of functions, i.e. the least positive integer T such that, for all 
m > n, 

A closely related number B(/) = the product of the lengths of the 
cycles of /, has previously been used as an approximation for T. This 
paper proves that the average values of these two quantities are quite 
different. The expected value of T is 



i^T^ = op *o;/X(i + o(i)) 



n» f ^ V V log' n 

where fco is a complicated but explicitly defined constant that is approxi- 
mately 3.36. The expected value of B is much larger: 



X - J2 B(/)=expf§ ^(l + o(l)) 



n 

/en 



1 Introduction 

Let fl„ be the n n -element set consisting of functions that have [n] as both 
domain and codomain, and let denote / composed with itself t times. 
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Since fi n is finite, it is clear (by the pigeonhole principle) that, for any 
/ £ fin, the sequence of compositional iterates 

/,/ (2 \/ (3) ,/ (4) ... 

must eventually repeat. Define T(/) to be the period of this sequence, 
i.e. the least T such that, for all m > n, 

,(m+T) _ j(m) 

We say v £ [n] is a cyclic vertex if there is a t such that p*' (v) = v. The 
restriction of / to cyclic vertices is a permutation of the cyclic vertices, 
and the period T is just the order of this permutation, i.e. the least 
common multiple of the cycle lengths. 

Harris showed that T(/) = ei log2 for most functions /. To 

make this precise, let P n denote the uniform distribution on fi„; P n ({f}) = 
n~ n for all /. Define a n = | log 2 n,let b n = -A= log 3//2 n, and let <j>(x) = 




sing Erdos and Turan's seminal results [9], Harris proved 



— oo 

Theorem 1 (Harrisflffj) For any fixed x, 

^p/JogT-o^ \ =4>{x) _ 

R-KX) \ O n J 

Comment: Harris actually stated his theorem for a closely related ran- 
dom variable 0(/) = the number of distinct functions in the sequence 
/, f( 2 ^ , , . . . . However it is clear from his proof that Theorem [T] holds 
too. In fact, it is straightforward to verify that, for all / £ Sl n , |0(/) — 
T(/)| < n. Such inequalities have been proved by Denes [8]. 

Let B(/) be the product, with multiplicities, of the lengths of the cycles 
of /. Obviously T(/) < B(/) for all /, and for some exceptional functions 
B(/) is much larger than T(/). For example, if / is a permutation with 
n/2 cycles of length 2, then B(/) = 2 n/2 , but T(/) = 2. In fact, the 
maximum value T can have is e v"^°i™( 1 +°( 1 )) 20 . However, for most 
random mappings / £ B(/) is a reasonably good approximation for 
T(/). For example, the following proposition follows easily from results of 
Arratia and Tavare [3|. 

Proposition 1 Let £ n ,n = 1,2,... be any sequence of positive numbers. 
There is a constant c > such that, for all sufficiently large n, 

„ „ „ , „ . c log n flog log n) 2 
P n (logB - logT > £ n ) < a v s a ' . 

Although logB(/) and logT(/) are approximately equal for most func- 
tions /,the set of exceptional functions is nevertheless large enough so 
that the expected values of the two random variables B and T are quite 
different. The following theorem will be proved. 

Theorem 2 E n (B) = exp (§#n + 0(n 1/6 )) . 
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To state a corresponding theorem for T, we need to define a constant fco- 

oo 

First define I = J log log( 1 _^_ t )dt, and define (3q = V&I. This constant 
o 

first appears first in [13] where it is proved that the expected order of a 
random permutation is exp y^os/nj log n(l + o(l))^ . Also define 



fco = ^(37) 2/3 . (1) 



We prove 
Theorem 3 



E n (T) =exp ^ ^__ (1 + (1)) 

The paper is organized as follows. Section 2 reviews some relevant 
graph theory. In particular, we need Renyi's asymptotic approximations 
for the expected length of the cycle in a random / that is chosen uniformly 
from those / g fi n that have one cycle. In section 3, we apply Hansen's 
deconditioning argument so that estimation of E„(B) can be reduced to 
a simpler problem of estimating the coefficients of a particular generating 
function E Z (B). In section 4, we use a Tauberian theorem of Odlyzko to 
estimate the coefficients of E Z (B) and complete the proof of Theorem [2] 
In section 5, estimates for E„(T) are deduced from earlier work on the 
expected order of a random permutation. 



2 The cycle length for a connected map. 

If / £ f2 n , let Df be the directed graph with vertex set [n] and a directed 
edge from v to w if and only if /(«) = w. Provided we count loops and 
cycles of length 2, the weak components of Df each have exactly one 
directed cycle. Let U n C fi n be the set of functions for which Df has 
exactly one weak component. For / & U„, let L n (f) be the length of the 
unique cycle / has. Let k„ be the expected value of L n for a uniform 
random / G U n - We will need asymptotic estimates for K n . Apparently 
Renyi did this calculation in [22] : see page 366 of Bollobas [7|. 

Lemma 4 (Renyi) 
and 

Additional background material on random mappings can be found in 
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3 Deconditioning 

The proof of Theorem [3] relies on a probabilistic technique that Jennie 
Hansen developed for random mappings in [14] . Let P n be the uniform 

oo 

probability measure on Q n , and let Q — [J Q n . For any / 6 f2 and any 

n=X 

positive integer d, let ay(/) = the number of d- vertex weak components 
that the graph Df has. Given z £ (0, 1), and A C f2, define -Pz(^4) to be 
the probability that the following procedure selects an element of A: 

• Choose independently the numbers ax, ad, Q3, . . . , where ad has a 



d-l k 

Poisson distribution with mean \d — ( e~ d ^ 4r 

fc=0 



• Let v = y~] da 



d. 

• Pick / uniform randomly from among all / € fi„ having compo- 
nents of size d,d = 1, 2, 3 ... . 

This defines a probability measure P z on fi. For any 'J/ : SI — > 5ft, let 
E z {^>) = denote its P z expected value, and let = 

/ 

E g {fff\v = n) = — sr X] Hansen proved the following theorem 

Theorem 5 ( \14j ) If ^ IS an V function on Q that is determined by the 
sequence ax, a%, . . . , then 

n n 1 — 2 

00 

where H(z) = ^ ^-(z/e) m and " \[z n "^\"means "the coefficient of z n in 

m=0 

The point of this construction is that E z (ty) is much easier to compute 
that E n {^) because the variables ad,d — 1,2,... are independent with 
respect to P z and not with respect to P n . It is quite analagous to Shepp 
and Lloyd's work on random permutations [24] . 

For the problem under consideration, we let Bd = the product of the 

lengths of cycles in d— vertex components ( If / has no d- vertex compo- 

00 

nents, then Bd = 1). Thus B = FJ Bd is a product of P z -independent 

d=x 

random variables, and 

00 

E Z (B) = Y[ E z (Bd). (2) 

d=X 

Evaluating the d'th term in this product, we have 

E z {Bd) = ^P z {ad = m)E z {Bd\a d = m). (3) 

771 

Note that 

E z (B d \a d = m) = (4) 
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where Kd = average length of the cycle for a uniform random connected 
map on d vertices. Therefore 

E z {B d ) = e- x * +XdKd . (5) 

Combining ©, (J2J, and Theorem [5] we get 



E n (B) = ^-[^H^-A-exp $>«,(*, - 1; 

\d=l 



(6) 



To simplify notation later, define Cd so that CdZ d — \d{Kd — 1), i.e. 

-( e ^ETT) = ^ + °(l)- (7) 



(^d - 1) , -iv-ii' 
c d = - — " x 



d-l . 

-d 



( Further information of the approximation e d ^ = | + 0-j=) can 



fc=0 



OO 



be found in [TO].) Let /i(m) = [[z m ]] yr^ ex P ( E c dZ d )J , and recall that, 
for all m, [[z m ]]tf(z) = Then 

„ n!e" ^ , . (n-m)"- m 

In the next section we derive an asymptotic formula for log fJ,(n), and then 
use it to estimate the right side of |(HJ. 



4 Tauberian theorem 

In this section, we derive an asymptotic formula for log /i(n). Let a„ 
fj,(ri) - ^(n — l), and for s > 0, let F(s) = J] a n e~ ns = exp ( J] c d e~ ds 



n \d— 1 



Also let gr(s) = log F(s) = ^ c d e~ . To apply Odlyzko's Tauberian 

d=l 

theorem|21]. we will need asymptotic estimates for the ji'th derivative 

oo 

9 U) {s) = E {-dyc d e- ds for j = 0, 1, 2, 3. From Q, we have 



d=l 



Hence, as s — > + , 



(-iyr( J + i) t ^ i 



r^ + o(-). (10) 



5 



"Rankin's method" (see Proposition 1 of [21]) is the very useful observation 
that, for any s > 0, 

Ufa) < e nB+3{s) . (11) 

In particular, for s — - \j 3 and j = 0, we can combine (|ll|l and ()10|) to 
get 

/x(n)<e X p(^n 1/3 +0(l). (12) 

Now Odlyzko's Tauberian theorem can be used to derive a lower bound 
for fj,(n). The remainder of this paragraph is a straightforward verification 
of the conditions of Theorem 1 in [21]. Let s* = s*(n) be chosen so as to 
By calculus and (|10[) , 



' = a .(l + O(V50). (13) 



2n 2 / 3 



It is clear that s* = o(l). We can "boostrap" twice to get a better estimate. 
First, by replacing 0{^/s7) with o(l) in (fT3|l. we get s* = 0(^573). Putting 
this rough bound back into (|13p again yields 

^ = ^73(1 + ^)). (14) 
Now let A n = g"(a»). By dTDJ) and (fTI)) . 

A,=3n 5 / 3 (1 + 0(i)). (15) 
Then, by (TO), ([T4")|. and (JTSj), we have 

|/'( S ,)| ~15n 7/3 =o(^ 2 ). (16) 

Having verified the conditions of Odlyzko's theorem, we conclude that, 
for all sufficiently large n, 

fi(n) > F(s,) exp fns, - SOsUl/ 2 - WO] = ^+°<" 1/6 ). (17) 



By Stirling's formula, 



m!e m ^2lrm 
Finally, putting (JTHJ and (JTZj) and JT2J> back into ©, we get 
Theorem 6 E n (B) = el «=+0(" 1/6 >. 

5 Order 

The main goal in this section is the proof of Theorem [3] the estimate for 
the average period E„(T). However first, for comparison and perspective, 
we prove Proposition [1] concerning the typical period, that was stated in 
the introduction. 
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Proof: Let Z(f) = denote the number of cyclic vertices / has. By the 
Law of Total Probability, 

P n (log B — log T > l n ) — P n (Z = m)P„ (log B - log T > t n \Z = to). 

m 

(19) 

In the proof of Theorem 8, page 333 of ,3 , Arratia and Tavare computed 
the expected value of log B — log T given the number of cyclic vertices: 
£„(logB - logT|Z = to) = 0(logm(loglogm) 2 ). (Note: their use of the 
notation P n is not consistent with the notation in this paper.) Therefore, 
by Markov's inequality,there is a constant c > such that, for all I > 0, 

P„(logB - logT > l\Z = m) < Cl0gm(1 ° gl ° gm)2 < Cl0gn(1 ° gl0gn)2 . 

(20) 

Putting (|20p back to the sum (|19p . we get the proposition. □ 

Let Sym([n]) C Q n be the n! element set of bijections, and let M n = 
E n (T\Sym([n])) = Yl T(cr) be the average of the orders of the 

o-£LSym([n}) 

permutations of \n\. Given the set Z of cyclic vertices, the restriction of 
a random / to Sym(Z) is a uniform random permutation of Z. Hence 

71 

En(T)= ^P„(Z = m)M m . (21) 

m— 1 

Two helpful theorems area make it possible to estimate this sum. The 
first is a simple formula for P n (Z — m) that appears in [17] and is at- 
tributed to Rubin and Sitgreaves. 

Theorem 7 P„(Z = to) = -, m 

The second helpful theorem is an estimate for M m . Using Erdos and 
Turan's Tauberian theorem[9. , Richard Stong proved in [25] that 

Theorem 8 logM m = ft y/m/ log to + 0( m ) ■ 

With Theorems [7] and [8] available, we can prove prove 



Theorem 9 log£ n (T) = fc (l + o(l)) . 

Proof: Define ao = and let mo = the closest integer to ao ^ n . 

For the lower bound, simply let m — mg in the trivial lower bound 
E„(T) > P„(Z = m)M m .Then, by Theorem^] TheoremE] and Stirling's 
formula, E n (T) is greater than 



exP f +o(^f) + pJ^ +0( y-5 iogiog -s 



2n n? y logm^ log 



, fc n 1/3 . „,n 1/3 loglogn 

ex P ; 2/3 + ° i 7/6 ) 

Joe 273 ?! log 7/b n 



For the upper bound, suppose e > is a fixed but arbitrarily small positive 

n! 

(n — m)\n 11 



number. Define f3 e = /3 + e, and w e (m) = ",' m _ t e fe logm . By 
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Theorem El M m < e /W m /i°g™ for all sufficiently large m. Therefore, for 
all sufficiently large n, 

E n (T) < nmaxP„(Z = m)M m < maxw E (m). (22) 

For 6 < m < n, let G„, e (m) = logui e (m). If we write (n — m)\ — 
r(n + 1 — m), then G n , t (x) is twice differentiable for all real numbers x 

with 6 < x < n. Let ^(y) = 'wffi be the logarithmic derivative of the 
Gamma function so that the first two derivatives of G n ,e are 

G n , e (x) = *(n + 1 - z) - logn + - /l (1 - -!-), (23) 

2y/x log x log x 

and 

GC,.(x) = -tt'(n + 1 - x) + § ( f ■ (24) 

4 :r 3 / 2 log s/ i 

. OO 

It is well known g] that * (y) = J] j^ps > 0. Thus both terms in (f24|) 

are negative, and we we have G n ^(x) < for 6 < x < n. Let x* n e be 
the unique solution to G n ^(x) — at which G„, e attains its maximum. 
We need to estimate x* n e , and then use that estimate to approximate 

Gn,e \<^n,e)' 

Define m*, e = /J^'' 3 \/3/8 n~XT73 i and l et ^ S (0, |). In order to prove 
that 



(1 - 5)m* j£ < x*, e < (1 + 5)m*, e , (25) 

it suffices to verify that G n>€ ((l ~ S)m*) > and G„, e ((l + 5)m*) < 0. It 
is well known [4] that 

*(y)=logy + 0(i). (26) 
Putting ([26} in p3)l. we get 



- -=4|={l-(l^ + 0(^))). (27) 
VI - (5^8?! log n L log" J 

To determine the sign, note that 1 — (1 — S) 3 ^ 2 > |<5. Hence G„ ]( ,((l — 
<5)m«) > 0, and we can even allow S — 8 n to depend on n so long as 
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oo. By similar reasoning G n e ((l + 5)m*) < 0. Therefore a:^ e 
mJU(l + 0( "t° E „"' 2 )- But then > b y Stirling's formula, G„, e « E ) 
££(1 + 0(1)), where 



log log n 



(P 2/s {/3/8) 2 , „ (3 2/3 ^/3/8 
fe = 2 +/3e V 2/3 ' 

The theorem now follows from the fact that e was an arbitrarily small 
positive number, and lim k t = ko. □ 

e-»0+ 
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6 Discussion 



The calculations in this paper depend heavily on the fact that the proba- 
bility measure P„ is uniform. A separate paper considers Ewens-type dis- 
tributions where the probability of a permutation or mapping is weighted 
according to the number of components it has [5], [15] .[25]. I do not know 
if the results in these papers can be extended to more general indepen- 
dent choice models such as those considered by Jaworski in |18] or the 
p- mappings of Aldous and Pitman, [TJ,[2]- 
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